Multi-index regression models with missing covariates at random

نویسندگان

  • Xu Guo
  • Wangli Xu
  • Lixing Zhu
چکیده

AMS subject classifications: 62H12 62G20 Keywords: Covariates missing at random Inverse selection probability Multi-index model Single-index model a b s t r a c t This paper considers estimation of the semiparametric multi-index model with missing covariates at random. A weighted estimating equation is suggested by invoking the inverse selection probability approach, and estimators of the indices are respectively defined when the selection probability is known in advance, is estimated parametrically and nonparametrically. The consistency is provided. For the single-index model, the large sample properties show that the estimators with both parametric and nonparametric plug-in estimations can play an important role to achieve smaller limiting variances than the estimator with the true selection probability. Simulation studies are carried out to assess the finite sample performance of the proposed estimators. The proposed methods are applied to an AIDS clinical trials dataset to examine which method could be more efficient. A horse colic dataset is also analyzed for illustration. The multi-index regression model (MIM) for the scalar outcome variable Y and covariate X of dimension p has the form Y = g(θ τ 0 X) + ε, (1) where g(·) is the unknown mean function defined on R d , and θ 0 is an unknown regression parameter matrix of dimension p × d. In this paper, we assume the dimension d is known in advance. The conditional expectation of ε given X equals zero, and the superscript τ in (1) denotes transposition. For identifiability consideration, we assume that the direction parameter θ 0 satisfies θ τ 0 θ 0 = I d an identity matrix. When d = 1, the model is the well-known single-index model (SIM). In this case, the first component of θ 0 is assumed to be positive without loss of generality. The multi-index model is widely used, in several areas such as statistics and econometrics, as a reasonable compromise between fully parametric and fully nonparametric modeling. A remarkable amount of research has already been carried out for statistical inference in the context of completed samples. For estimation of the parameter θ 0 as well as of the link function g(·), there are a number of proposals in the literature. For multi-index and more general models, the examples of methods are sliced inverse regression (SIR, Li, [9]), sliced average variance estimation (SAVE, Cook and Weisberg, [3]). Zhu and Ng [23] and Zhu and Fang [22] gave general results …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش بیزی (Bayesian) و کلاسیک در برآرد پارامترهای مدل رگرسیون لجستیک با وجود مقادیر گمشده در متغیرهای کمکی

Background and Aim: Logistic regression is an analytic tool widely used in medical and epidemiologic research. In many studies, we face data sets in which some of the data are not recorded. A simple way to deal with such "missing data" is to simply ignore the subjects with missing observations, and perform the analysis on cases for which complete data are available. Materials and Methods: We c...

متن کامل

Conditional and unconditional categorical regression models with missing covariates.

We consider methods for analyzing categorical regression models when some covariates (Z) are completely observed but other covariates (X) are missing for some subjects. When data on X are missing at random (i.e., when the probability that X is observed does not depend on the value of X itself), we present a likelihood approach for the observed data that allows the same nuisance parameters to be...

متن کامل

Inference using conditional logistic regression with missing covariates.

When there are many nuisance parameters in a logistic regression model, a popular method for eliminating these nuisance parameters is conditional logistic regression. Unfortunately, another common problem in a logistic regression analysis is missing covariate data. With many nuisance parameters to eliminate and missing covariates, many investigators exclude any subject with missing covariates a...

متن کامل

Bayesian Single Index Model with Covariates Missing at Random

Bayesian single index model is a highly promising dimension reduction tool for an interpretable modeling of the non linear relationship between the response and its predictors. However, existing Bayesian tools in this area suffer from slow mixing of the Markov Chain Monte Carlo (MCMC) computational tool and also lack the ability to deal with missing covariates. To circumvent these practical pro...

متن کامل

Handling Missing Covariates in Conditional Mixture Models Under Missing at Random Assumptions.

Mixture modeling is a popular method that accounts for unobserved population heterogeneity using multiple latent classes that differ in response patterns. Psychologists use conditional mixture models to incorporate covariates into between-class and/or within-class regressions. Although psychologists often have missing covariate data, conditional mixtures are currently fit with a conditional lik...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Multivariate Analysis

دوره 123  شماره 

صفحات  -

تاریخ انتشار 2014